Towards Enhanced Identification of Emotion from Resource-Constrained Language through a novel Multilingual BERT Approach
نویسندگان
چکیده
Emotion identification from text has recently gained attention due to its versatile ability analyze human-machine interaction. This work focuses on detecting emotions textual data. Languages, like English, Chinese, and German are widely used for classification, however, limited research is done resource-poor oriental languages. Roman Urdu (RU) a resource-constrained language extensively across Asia. predicting RU text. For this, dataset collected different social media domains based Paul Ekman's theory it annotated with six basic emotions, i.e., happy, surprise, angry, sad, fear, disgusting. Dense word embedding representations of languages adopted that utilize existing pre-trained models. BERT additionally fine-tuned the classification task. The proposed approach compared baseline machine learning deep algorithms. Additionally, comparison current also performed approaches same Based empirical evaluation, performs better than state-of-the-art an average accuracy 91%.
منابع مشابه
Enhancing Multilingual Recognition of Emotion in Speech by Language Identification
We investigate, for the first time, if applying model selection based on automatic language identification (LID) can improve multilingual recognition of emotion in speech. Six emotional speech corpora from three language families (Germanic, Romance, Sino-Tibetan) are evaluated. The emotions are represented by the quadrants in the arousal/valence plane, i. e., positive/negative arousal/valence. ...
متن کاملTowards a Language Independent Encoding of Documents: A Novel Approach to Multilingual Question Answering
Given source text in several languages, can one answer queries in some other language, without translating any of the sources into the language of the questioner? In this paper we try to address this question as we report our work on a restricted domain, multilingual Question – Answering system, with current implementations for source text in English and questions posed in English and Hindi. Th...
متن کاملRclp: a Novel Approach for Resource-constrained Loop Pipelining Rclp:a Novel Approach for Resource-constrained Loop Pipelining 3
In this paper a novel technique for resource-constrained loop pipelining is presented. RCLP is based on several dependence graph operations: loop unrolling, operation retiming, resource-constrained scheduling, and span reduction. All these operations are focused to nd a minimum length schedule able to be executed with a limited number of resources and thus maximizing resource utilization. Exper...
متن کاملMultilingual native language identification
We present the first study of Native Language Identification (NLI) applied to text written in languages other than English, using data from six languages. NLI is the task of predicting an author’s first language (L1) using only their writings in a second language (L2), with applications in Second Language Acquisition and forensic linguistics. Most research to date has focused on English but the...
متن کاملLanguage Identification in Multilingual Documents
Most optical character recognition (OCR) systems can recognize at most a few languages. For large archives of document images that contain different languages, there must be some way to automatically categorize these documents before applying the proper OCR on them. This report presents a research in the identification of English, Chinese, Malay and Tamil in image documents. While most other wo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ACM Transactions on Asian and Low-Resource Language Information Processing
سال: 2023
ISSN: ['2375-4699', '2375-4702']
DOI: https://doi.org/10.1145/3592794